Restoring 2D content from distorted documents
Identifieur interne : 000F25 ( Main/Exploration ); précédent : 000F24; suivant : 000F26Restoring 2D content from distorted documents
Auteurs : Michael S. Brown [Singapour] ; MINGXUAN SUN [États-Unis] ; RUIGANG YANG [États-Unis] ; LIN YUN [États-Unis] ; W. Brent Seales [États-Unis]Source :
- IEEE transactions on pattern analysis and machine intelligence [ 0162-8828 ] ; 2007.
Descripteurs français
- Pascal (Inist)
- Intelligence artificielle, Analyse forme, Eclairement, Texte, Formation image, Reconnaissance optique caractère, Reconnaissance caractère, Haute résolution, Résolution image, Traitement image, Image tridimensionnelle, Traitement document, Caractère imprimé, Document imprimé, Luminance, Surface plane, Structure document, Transformation géométrique, Transformation conforme, Gradient, Facteur réflexion, Artefact, Source lumineuse.
- Wicri :
- topic : Intelligence artificielle.
English descriptors
- KwdEn :
- Algorithms, Artefact, Artificial Intelligence, Artificial intelligence, Automatic Data Processing (methods), Character recognition, Computer Graphics, Conformal transformation, Document processing, Document structure, Documentation (methods), Geometric transformation, Gradient, High resolution, Illumination, Image Enhancement (methods), Image Interpretation, Computer-Assisted (methods), Image processing, Image resolution, Imaging, Information Storage and Retrieval (methods), Light source, Luminance, Numerical Analysis, Computer-Assisted, Optical character recognition, Pattern Recognition, Automated (methods), Pattern analysis, Plane surface, Printed character, Printed document, Reflectance, Reproducibility of Results, Sensitivity and Specificity, Signal Processing, Computer-Assisted, Subtraction Technique, Text, Tridimensional image, User-Computer Interface.
- MESH :
- methods : Automatic Data Processing, Documentation, Image Enhancement, Image Interpretation, Computer-Assisted, Information Storage and Retrieval, Pattern Recognition, Automated.
- Algorithms, Artificial Intelligence, Computer Graphics, Numerical Analysis, Computer-Assisted, Reproducibility of Results, Sensitivity and Specificity, Signal Processing, Computer-Assisted, Subtraction Technique, User-Computer Interface.
Abstract
-This paper presents a framework to restore the 2D content printed on documents in the presence of geometric distortion and nonuniform illumination. Compared with text-based document imaging approaches that correct distortion to a level necessary to obtain sufficiently readable text or to facilitate optical character recognition (OCR), our work targets nontextual documents where the original printed content is desired. To achieve this goal, our framework acquires a 3D scan of the document's surface together with a high-resolution image. Conformal mapping is used to rectify geometric distortion by mapping the 3D surface back to a plane while minimizing angular distortion. This conformal "deskewing" assumes no parametric model of the document's surface and is suitable for arbitrary distortions. Illumination correction is performed by using the 3D shape to distinguish content gradient edges from illumination gradient edges in the high-resolution image. Integration is performed using only the content edges to obtain a reflectance image with significantly less illumination artifacts. This approach makes no assumptions about light sources and their positions. The results from the geometric and photometric correction are combined to produce the final output.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000317
- to stream PascalFrancis, to step Curation: 000469
- to stream PascalFrancis, to step Checkpoint: 000267
- to stream Main, to step Merge: 000F38
- to stream PubMed, to step Corpus: 000058
- to stream PubMed, to step Curation: 000058
- to stream PubMed, to step Checkpoint: 000058
- to stream Ncbi, to step Merge: 000041
- to stream Ncbi, to step Curation: 000041
- to stream Ncbi, to step Checkpoint: 000041
- to stream Main, to step Merge: 000D55
- to stream Main, to step Curation: 000F25
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Restoring 2D content from distorted documents</title>
<author><name sortKey="Brown, Michael S" sort="Brown, Michael S" uniqKey="Brown M" first="Michael S." last="Brown">Michael S. Brown</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>School of Computer Engineering, Nanyang Technological University, Blk N4, 2A-32, Nanyang Avenue</s1>
<s2>Singapore 639798</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 639798</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Mingxuan Sun" sort="Mingxuan Sun" uniqKey="Mingxuan Sun" last="Mingxuan Sun">MINGXUAN SUN</name>
<affiliation wicri:level="2"><inist:fA14 i1="02"><s1>GVU Center, Georgia Institute of Technology, 85 Fifth St. NW</s1>
<s2>Atlanta, GA 30332-0760</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Géorgie (États-Unis)</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Ruigang Yang" sort="Ruigang Yang" uniqKey="Ruigang Yang" last="Ruigang Yang">RUIGANG YANG</name>
<affiliation wicri:level="2"><inist:fA14 i1="03"><s1>Computer Science Department, University of Kentucky, 232 Hardymon Building</s1>
<s2>Lexington, KY 40506</s2>
<s3>USA</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Kentucky</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Lin Yun" sort="Lin Yun" uniqKey="Lin Yun" last="Lin Yun">LIN YUN</name>
<affiliation wicri:level="2"><inist:fA14 i1="04"><s1>Laboratory for Advanced Networking, University of Kentucky, Hardymon Building, 2nd Floor, 301 Rose Street</s1>
<s2>Lexington, KY 40506-0495</s2>
<s3>USA</s3>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Kentucky</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Seales, W Brent" sort="Seales, W Brent" uniqKey="Seales W" first="W. Brent" last="Seales">W. Brent Seales</name>
<affiliation wicri:level="2"><inist:fA14 i1="04"><s1>Laboratory for Advanced Networking, University of Kentucky, Hardymon Building, 2nd Floor, 301 Rose Street</s1>
<s2>Lexington, KY 40506-0495</s2>
<s3>USA</s3>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Kentucky</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">07-0506314</idno>
<date when="2007">2007</date>
<idno type="stanalyst">PASCAL 07-0506314 INIST</idno>
<idno type="RBID">Pascal:07-0506314</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000317</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000469</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000267</idno>
<idno type="wicri:doubleKey">0162-8828:2007:Brown M:restoring:d:content</idno>
<idno type="wicri:Area/Main/Merge">000F38</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="RBID">pubmed:17848773</idno>
<idno type="wicri:Area/PubMed/Corpus">000058</idno>
<idno type="wicri:Area/PubMed/Curation">000058</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000058</idno>
<idno type="wicri:Area/Ncbi/Merge">000041</idno>
<idno type="wicri:Area/Ncbi/Curation">000041</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000041</idno>
<idno type="wicri:doubleKey">0162-8828:2007:Brown M:restoring:d:content</idno>
<idno type="wicri:Area/Main/Merge">000D55</idno>
<idno type="wicri:Area/Main/Curation">000F25</idno>
<idno type="wicri:Area/Main/Exploration">000F25</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Restoring 2D content from distorted documents</title>
<author><name sortKey="Brown, Michael S" sort="Brown, Michael S" uniqKey="Brown M" first="Michael S." last="Brown">Michael S. Brown</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>School of Computer Engineering, Nanyang Technological University, Blk N4, 2A-32, Nanyang Avenue</s1>
<s2>Singapore 639798</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 639798</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Mingxuan Sun" sort="Mingxuan Sun" uniqKey="Mingxuan Sun" last="Mingxuan Sun">MINGXUAN SUN</name>
<affiliation wicri:level="2"><inist:fA14 i1="02"><s1>GVU Center, Georgia Institute of Technology, 85 Fifth St. NW</s1>
<s2>Atlanta, GA 30332-0760</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Géorgie (États-Unis)</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Ruigang Yang" sort="Ruigang Yang" uniqKey="Ruigang Yang" last="Ruigang Yang">RUIGANG YANG</name>
<affiliation wicri:level="2"><inist:fA14 i1="03"><s1>Computer Science Department, University of Kentucky, 232 Hardymon Building</s1>
<s2>Lexington, KY 40506</s2>
<s3>USA</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Kentucky</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Lin Yun" sort="Lin Yun" uniqKey="Lin Yun" last="Lin Yun">LIN YUN</name>
<affiliation wicri:level="2"><inist:fA14 i1="04"><s1>Laboratory for Advanced Networking, University of Kentucky, Hardymon Building, 2nd Floor, 301 Rose Street</s1>
<s2>Lexington, KY 40506-0495</s2>
<s3>USA</s3>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Kentucky</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Seales, W Brent" sort="Seales, W Brent" uniqKey="Seales W" first="W. Brent" last="Seales">W. Brent Seales</name>
<affiliation wicri:level="2"><inist:fA14 i1="04"><s1>Laboratory for Advanced Networking, University of Kentucky, Hardymon Building, 2nd Floor, 301 Rose Street</s1>
<s2>Lexington, KY 40506-0495</s2>
<s3>USA</s3>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Kentucky</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">IEEE transactions on pattern analysis and machine intelligence</title>
<title level="j" type="abbreviated">IEEE trans. pattern anal. mach. intell.</title>
<idno type="ISSN">0162-8828</idno>
<imprint><date when="2007">2007</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">IEEE transactions on pattern analysis and machine intelligence</title>
<title level="j" type="abbreviated">IEEE trans. pattern anal. mach. intell.</title>
<idno type="ISSN">0162-8828</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithms</term>
<term>Artefact</term>
<term>Artificial Intelligence</term>
<term>Artificial intelligence</term>
<term>Automatic Data Processing (methods)</term>
<term>Character recognition</term>
<term>Computer Graphics</term>
<term>Conformal transformation</term>
<term>Document processing</term>
<term>Document structure</term>
<term>Documentation (methods)</term>
<term>Geometric transformation</term>
<term>Gradient</term>
<term>High resolution</term>
<term>Illumination</term>
<term>Image Enhancement (methods)</term>
<term>Image Interpretation, Computer-Assisted (methods)</term>
<term>Image processing</term>
<term>Image resolution</term>
<term>Imaging</term>
<term>Information Storage and Retrieval (methods)</term>
<term>Light source</term>
<term>Luminance</term>
<term>Numerical Analysis, Computer-Assisted</term>
<term>Optical character recognition</term>
<term>Pattern Recognition, Automated (methods)</term>
<term>Pattern analysis</term>
<term>Plane surface</term>
<term>Printed character</term>
<term>Printed document</term>
<term>Reflectance</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
<term>Signal Processing, Computer-Assisted</term>
<term>Subtraction Technique</term>
<term>Text</term>
<term>Tridimensional image</term>
<term>User-Computer Interface</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>Automatic Data Processing</term>
<term>Documentation</term>
<term>Image Enhancement</term>
<term>Image Interpretation, Computer-Assisted</term>
<term>Information Storage and Retrieval</term>
<term>Pattern Recognition, Automated</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Algorithms</term>
<term>Artificial Intelligence</term>
<term>Computer Graphics</term>
<term>Numerical Analysis, Computer-Assisted</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
<term>Signal Processing, Computer-Assisted</term>
<term>Subtraction Technique</term>
<term>User-Computer Interface</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Intelligence artificielle</term>
<term>Analyse forme</term>
<term>Eclairement</term>
<term>Texte</term>
<term>Formation image</term>
<term>Reconnaissance optique caractère</term>
<term>Reconnaissance caractère</term>
<term>Haute résolution</term>
<term>Résolution image</term>
<term>Traitement image</term>
<term>Image tridimensionnelle</term>
<term>Traitement document</term>
<term>Caractère imprimé</term>
<term>Document imprimé</term>
<term>Luminance</term>
<term>Surface plane</term>
<term>Structure document</term>
<term>Transformation géométrique</term>
<term>Transformation conforme</term>
<term>Gradient</term>
<term>Facteur réflexion</term>
<term>Artefact</term>
<term>Source lumineuse</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Intelligence artificielle</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">-This paper presents a framework to restore the 2D content printed on documents in the presence of geometric distortion and nonuniform illumination. Compared with text-based document imaging approaches that correct distortion to a level necessary to obtain sufficiently readable text or to facilitate optical character recognition (OCR), our work targets nontextual documents where the original printed content is desired. To achieve this goal, our framework acquires a 3D scan of the document's surface together with a high-resolution image. Conformal mapping is used to rectify geometric distortion by mapping the 3D surface back to a plane while minimizing angular distortion. This conformal "deskewing" assumes no parametric model of the document's surface and is suitable for arbitrary distortions. Illumination correction is performed by using the 3D shape to distinguish content gradient edges from illumination gradient edges in the high-resolution image. Integration is performed using only the content edges to obtain a reflectance image with significantly less illumination artifacts. This approach makes no assumptions about light sources and their positions. The results from the geometric and photometric correction are combined to produce the final output.</div>
</front>
</TEI>
<affiliations><list><country><li>Singapour</li>
<li>États-Unis</li>
</country>
<region><li>Géorgie (États-Unis)</li>
<li>Kentucky</li>
</region>
</list>
<tree><country name="Singapour"><noRegion><name sortKey="Brown, Michael S" sort="Brown, Michael S" uniqKey="Brown M" first="Michael S." last="Brown">Michael S. Brown</name>
</noRegion>
</country>
<country name="États-Unis"><region name="Géorgie (États-Unis)"><name sortKey="Mingxuan Sun" sort="Mingxuan Sun" uniqKey="Mingxuan Sun" last="Mingxuan Sun">MINGXUAN SUN</name>
</region>
<name sortKey="Lin Yun" sort="Lin Yun" uniqKey="Lin Yun" last="Lin Yun">LIN YUN</name>
<name sortKey="Ruigang Yang" sort="Ruigang Yang" uniqKey="Ruigang Yang" last="Ruigang Yang">RUIGANG YANG</name>
<name sortKey="Seales, W Brent" sort="Seales, W Brent" uniqKey="Seales W" first="W. Brent" last="Seales">W. Brent Seales</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000F25 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000F25 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:07-0506314 |texte= Restoring 2D content from distorted documents }}
This area was generated with Dilib version V0.6.32. |